Pretest estimation in combining probability and non-probability samples
نویسندگان
چکیده
Multiple heterogeneous data sources are becoming increasingly available for statistical analyses in the era of big data. As an important example finite-population inference, we develop a unified framework test-and-pool approach to general parameter estimation by combining gold-standard probability and non-probability samples. We focus on case when study variable is observed both datasets estimating target parameters, each contains other auxiliary variables. Utilizing design, conduct pretest procedure determine comparability with decide whether or not leverage pooled analysis. When comparable, our combines efficient estimation. Otherwise, retain only also characterize asymptotic distribution proposed estimator under local alternative provide data-adaptive select critical tuning parameters that smallest mean square error estimator. Lastly, deal non-regularity estimator, construct robust confidence interval has good finite-sample coverage property.
منابع مشابه
Pretest probability assessment derived from attribute matching
BACKGROUND Pretest probability (PTP) assessment plays a central role in diagnosis. This report compares a novel attribute-matching method to generate a PTP for acute coronary syndrome (ACS). We compare the new method with a validated logistic regression equation (LRE). METHODS Eight clinical variables (attributes) were chosen by classification and regression tree analysis of a prospectively c...
متن کاملNon-zero probability of nearest neighbor searching
Nearest Neighbor (NN) searching is a challenging problem in data management and has been widely studied in data mining, pattern recognition and computational geometry. The goal of NN searching is efficiently reporting the nearest data to a given object as a query. In most of the studies both the data and query are assumed to be precise, however, due to the real applications of NN searching, suc...
متن کاملOne-Class Classification by Combining Density and Class Probability Estimation
One-class classification has important applications such as outlier and novelty detection. It is commonly tackled using either density estimation techniques or by adapting a standard classification algorithm to the problem of carving out a decision boundary that describes the location of the target data. In this paper we present a simple method for one-class classification that combines the app...
متن کاملCombining Probability Forecasts
Linear pooling is by the far the most popular method for combining probability forecasts. However, any nontrivial weighted average of two or more distinct, calibrated probability forecasts is necessarily uncalibrated and lacks sharpness. In view of this, linear pooling requires recalibration, even in the ideal case in which the individual forecasts are calibrated. Toward this end, we propose a ...
متن کاملAcceptable random variables in non-commutative probability spaces
Acceptable random variables are defined in noncommutative (quantum) probability spaces and some of probability inequalities for these classes are obtained. These results are a generalization of negatively orthant dependent random variables in probability theory. Furthermore, the obtained results can be used for random matrices.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronic Journal of Statistics
سال: 2023
ISSN: ['1935-7524']
DOI: https://doi.org/10.1214/23-ejs2137